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Abstract — In randomly deployed networks, such as sensor 
networks, an important problem for each node is to discover 
its neighbor nodes so that the connectivity amongst nodes can be 
established. In this paper, we consider this problem by incorpo- 
rating the physical layer parameters in contrast to the most of 
the previous work which assumed a collision channel. Specifically, 
the pilot signals that nodes transmit are successfully decoded if 
the strength of the received signal relative to the interference is 
sufficiently high. Thus, each node must extract signal parameter 
information from the superposition of an unknown number of 
received signals. This problem falls naturally in the purview of 
random set theory (RST) which generalizes standard probability 
theory by assigning sets, rather than values, to random outcomes. 
The contributions in the paper are twofold: first, we introduce the 
realistic effect of physical layer considerations in the evaluation 
of the performance of logical discovery algorithms; such an 
introduction is necessary for the accurate assessment of how 
an algorithm performs. Secondly, given the double uncertainty 
of the environment (that is, the lack of knowledge of the 
number of neighbors along with the lack of knowledge of the 
individual signal parameters), we adopt the viewpoint of RST 
and demonstrate its advantage relative to classical matched filter 
detection method. 

Index Terms — wireless sensor network, neighbor discovery, 
multipacket reception, random set theory 



I. Introduction 

WIRELESS sensor networks are gaining great attention 
due to its versatility in civilian/military applications 
such as environmental monitoring and target detection/tracking 
in a cost-effective manner In such applications, a large number 
of sensors are randomly deployed over the region of interest 
and, presumably, neighbor discovery is the first and foremost 
process to run after the deployment to form a network whose 
connectivity greatly affects the performance of subsequent 
network operations over the entire life span. The challenge is 
compounded by the fact that neighbor discovery has to be done 
without any a priori knowledge on the random deployment or 
any communication infrastructures. 

Neighbor discovery in wireless networks is defined to be the 
process to identify a set of nodes with which a node can com- 
municate, and it has been addressed by several authors fTl-pl. 
In im, a simple ALOHA-like neighbor discovery algorithm 
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was proposed in which each node randomly transmit/listen 
in each time slot and analyzed for both synchronous and 
asynchronous timing cases. This type of discovery algorithm 
based on the random access protocol is well suited for ran- 
domly distributed wireless networks. In {2], similar neighbor 
discovery algorithm was considered and the expected time to 
find neighbors was obtained. In 13], a gossip-based algorithm 
was proposed in which each node transmits a table of gossip 
data (which is the list of neighbors that it has discovered so 
far and their locations) in a random direction using directional 
antennas. In |4|, a family of probabilistic protocols, called 
birthday protocols, have been proposed to initiate the randomly 
deployed wireless networks. From a physical-layer point of 
view, however, the previous works are extremely limited due 
to the use of collision channel model. Under this model, if 
more than one nodes transmit at the same time, none of them 
are successful. However, it is too pessimistic in the sense that a 
transmission may succeed even in the presence of interference 
which is called capture effect Q-lj^. We, thus, claim that the 
performance of neighbor discovery algorithm has been quite 
underestimated so far due to the use of unrealistic channel 
model, and correct reassessment of the discovery algorithm is 
required. 

In this paper, we consider a shared channel and nodes 
with multipacket reception capability in which a transmission 
is successful if the received signal-to-interference-plus-noise- 
ratio (SINR) exceeds a certain threshold. Specifically, under 
the discovery algorithm proposed in 0], we obtain the expres- 
sion for the expected number of successful receptions per slot 
at a given SINR threshold and find the optimal transmission 
probability which maximizes the expected number of success- 
ful receptions. We note, however, that for a given modulation 
scheme and target bit error rate (BER), the data rate is an 
increasing function of the SINR threshold. Therefore, for a fair 
comparison, we normalize the slot duration as the unit time 
and obtain the expected number of successful receptions per 
second. After that, the performance of the discovery algorithm 
over multiple slots is studied which is useful, for example, in 
deciding the duration of the discovery process. 

In dealing with multipacket reception capability, the phys- 
ical layer signal processing issues cannot be overlooked. 
This is because the received signal at each time slot is the 
superposition of the signals transmitted from random set of 
nodes and noise. Therefore, determining the existence of a 
signal from a particular node itself is not an easy task. For this 
problem, we first present the classical matched filter method 
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which fundamentally treats the interference as noise. As an 
alternative, a more accurate method can be envisioned in an 
additional cost of complexity. Since the number of transmitters 
and their entities are all unknown, we adopt the viewpoint 
of random set theory (RST: see Appendix |A] and references 
therein) and propose RST-based method for detecting the 
transmitting nodes in each time slot l|T0ll - |fT6l . Besides, it is 
also possible with RST to estimate additional parameters of 
transmitted signals such as signal amplitudes and phase. 

This paper is organized as follows. The next section de- 
scribes the basic assumptions, establishes the notation, and 
presents the signal model. In Section |III1 details on the 
neighbor discovery algorithm is presented and the early ter- 
mination of the discovery process is discussed. In Section |IVl 
we analyze the chosen discovery algorithm under the SINR- 
based model. In Section |V] physical layer signal processing 
solutions are delivered. Specifically, we obtain matched filter 
detector for neighbor discovery and, as an alternative, RST- 
based maximum a posteriori (MAP) estimator is proposed. In 
Section |VI] numerical results are presented. Finally, we draw 
conclusions in Section WU[ 



transmits a 1-bit known message, then = {j/t,i, ■■■,yt,L}^ 



II. System Model 

We consider a time-slotted wireless sensor network which 
is deployed over a region of interest such as large tactical 
area for target detection or vast rural area for environmental 
observation. In such scenarios, a large number of sensor nodes 
are released from an airplane. For a large number of nodes 
over a large area, the locations of nodes are modeled by 
a homogeneous, two-dimensional Poisson point process with 
intensity A which is the average number of nodes per unit area. 
Therefore, the number of nodes in a unit area follows Poisson 
distribution with parameter A. As is well known, a spatial 
Poisson process on the plane, conditioned on a given number 
of nodes within a given area, yields the uniform distribution 
of these nodes in that area. As a result, for example, if a 
node's discovery region is modeled as a circle of radius Rq, 
the cumulative probability distribution on the distance from 
the node to the other nodes in the region is given by 



0, 
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(1) 



The signal received by a node at time slot t (if the node is 
hstening) is given by 



fcGit 



(2) 



where It is the set of transmitting neighbors, g^' is the complex 
amplitude of the signal received from the fc-th node, is the 
signal transmitted from the fc-th node which is the message 
multiplied by the signature sequence bitwise, and rit is a 
random noise. We assume that the signatures of all nodes are 
known to each other by assuming that they share an identical 
key generator which can be implemented using a linear feed- 
back shift register. If the length of signature is L and each node 



}^ which is equal to the signature sequence. 



and rit = {nt i, ...,nt.L}'^ , where the symbol T denotes 
vector transpose. The noise samples in rit are assumed to be 
independent and identically distributed (i.i.d.) with the normal 
distribution A/'(0, N) where N is the noise power which is the 
product of noise spectral density A'o and bandwidth B. The 
complex amplitude is of the form — ^ G(l + rk)~^tjjt, 
where G is the transmission power, is the distance from the 
fc-th node to the receiver, rj is the path loss exponent, and 
is the channel fading coefficient which is modeled by i.i.d. 
standard circular symmetric Gaussian random process |17|. 

Denote by J' and J the set of neighbors and the number 
of neighbors of a node, respectively, and we suppressed the 
particular node index for notational brevity. The SINR of node 
fc at time slot t (if the fc-th node is transmitting) is given by 



SINR^ = 



E 



Prx.^ + N 



where Prx,k is the received signal power from the fc-th 
transmitter. Note that the distances (fc = 1, . . . , J) are i.i.d. 
under the Poisson point process modeling. Also the channel 
fading coefficients t/j^ are i.i.d., and by further assuming 
that Tfe and ip^ are mutually independent, the received signal 
powers Prx.k {k = 1, . . . , J) are i.i.d. as well, and the common 
cumulative distribution function of the received signal power 
is obtained by Q 



— ) - 1 I f\4,\2{uj)duj (3) 



Fp{x) = 1 - 



where /|^|2(-) is the probabiUty density of the squared magni- 
tude of the stationary fading process which is an exponential 
with unit mean. The transmission of the fc-th node is said to 
be successful if 

(4) 



SINRf > T 



where the threshold r depends on parameters such as data rate 
and target BER [18] . Note that, if r < 1, it is possible for two 
or more signals to simultaneously satisfy Q, and if r > 1, at 
most one signal having the highest SINR may satisfy (|4|l. If t 
goes to oo and we neglect the noise effect, the criterion in (HI 
is equivalent to the collision channel model, and if r goes to 
0, all transmissions will be successful, but at the same time, 
the data rate goes to 0. 

III. The Neighbor Discovery Algorithm 

A. Description of the Neighbor Discovery Algorithm 

An ALOHA-like neighbor discovery algorithm proposed in 
[IJ is reconsidered in which each node transmits with proba- 
bility pt or listens with probability 1 — px- The transmission 
probability px and the transmission power G are identical 
for all nodes; all these simplifying assumptions were made to 
reduce non-essential complexities. Notice that a plain method 
such as the periodic beaconing can be thought but such a static 
approach would certainly fail in a randomly deployed network 
which has irregular node density. Under the chosen discovery 
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algorithm, in order for a node to be discovered by some other 
node, the former should transmit and the other should listen 
at the same time. On top of that, the transmission must be 
successful which requires a certain criterion; in the collision 
channel, the criterion is that there is only one transmission 
at that time slot. However, we adopt a more realistic SINR 
criterion for success and optimize the transmission probability 
which has been quite underestimated so far due to the use of 
collision channel model. 

B. Considerations on Early Termination 

Obviously, the marginal revenue of running the discovery 
algorithm diminishes as time goes and, because sensors are 
usually assumed to be battery-powered, it is needed to consider 
an early termination of the discovery process to prolong the 
lifetime of the network. Several metrics can be envisioned as 
a criterion for the early termination. A simple criterion is to 
terminate when a node has discovered a predetermined number 
of neighbors. In the case of nodes located at the sparsely 
populated areas, however, this may not be satisfied until the 
end of the discovery process (even if all the neighbors are 
discovered early). Alternatively, one may want to allow an 
early termination if a node has discovered a predetermined 
fraction of neighbors. However, because each node does not 
know a priori how many neighbors it has, this criterion is 
untestable. As will be seen in the next section, as time goes, the 
set of discovered neighbors in each time slot will overlap with 
the previously discovered ones. Therefore, if a node do not find 
any new neighbors for a sufficiently large number of slots, then 
it can be regarded that all the neighbors have been discovered. 
Hence, an early termination can be declared if a node does not 
receive any new messages during a predetermined number of 
time slots. 

IV. Analysis of the Discovery Algorithm with the 

MULTIPACKET RECEPTION CAPABILITY 

A. Optimal Transmission Probability 

We start by deriving the expected number of successful 
receptions in each time slot as a function of SINR threshold r. 
After that the transmission probability pT is set to maximize it. 
We first denote by If (C Ij) the set of transmitting neighbors 
satisfying the SINR criterion in (HJi. Then, the expected number 
of successful receptions by one node is expressed a^ 

J 

E[\rt\] = Pr {anode is listening} ^ Pr{|It I = n} Sn 



n=l 



ELbT"(i-PT) 



n=l 



J-n+1 



Sri 



where Sn is the expected number of successful receptions 
given n{> 1) simultaneous transmissions and is obtained by 

S„ = nPr { SmRl > t | |If | = n, 1 e I* } (6) 

where SINR^ is the SINR of the first transmitter Q. Note 
that (|6]l follows from the assumption that the received signal 

' I ■ I is the cai'dinality of a set. 



powers Prx.k {k = 1,...,J) are i.i.d. and, thus, the first 
transmitter needs not be the closest one to the receiver It is 
computed as 

Pr{SINR,i>T| |lt|=n,lelt} 

pOO pOO I \ 

= 1-/ ••■ / Fp T^a;JdFp(x2)---di^p(x„) 

(7) 

where the noise effect was neglected for simplicity. Conse- 
quently, in principle, we can find the optimal transmission 
probability that maximizes the expected number of successful 
receptions. 

Example 4.1: As an illustration of the use of above equa- 
tions, consider a simple example in which there are three 
nodes and they are within the radio range of each other Since 
all three nodes have two neighbors, the expected number of 
successful receptions by one node is given by 

E^\l\\ = 2pT(l-PT)'Pr{ SINRi > r | \W = 1,1 G I* } 
+ 2pt'(1 - PT)Pr { SINRi > r | = 2, 1 e I* } 

Let us further consider a simplified path loss model in which 
the received signal power is given by Prx,k 
can be easily shown that for r < 1, 



m\\\ ^ 

and for r > 1, ff 

£;[|if|] = (2- 



T 'J pt 



(2 + rt) 



PT 



2pi 



Then, it 



(8) 



PT 



4 Ut' + 2pt (9) 



Note that (O and ^ are (strictly) concave over the feasible 
region. Thus, differentiating them with respect to px, and 
setting the derivatives to 0, we find the optimal transmission 
probability pt^ which maximizes the expected number of 
successful receptions in each slot as 



Pt 



= < 



r~ + 2 - y (r~ - 1) 
3ti 



if r < 1 



4+ ^(r-^ -1)2 + 3 
3(r"t-2) 



if r > 1 



(10) 



In Fig. [U we plot (fTol i as a function of r with path loss 
exponent ry = 4. It can be seen that as threshold t increases, 
decreases. This is because at a higher threshold, it becomes 
more difficult for multiple transmitters to simultaneously sat- 
isfy the SINR criterion. We can also see that the maximum 
(5) expected number of successful receptions £'[|If|]* decreases 
as the threshold r increases. However, it does not necessarily 
mean that we will have a reduced number of successful 
receptions in a unit time, because the transmission duration 
will also become shorter We will look at this issue in the 
subsequent section. 

Remark 4.1: In |1|, the optimal transmission probability 
under the collision channel model was derived, and was given 
by the inverse of the total number of nodes as in the original 
slotted ALOHA system. Recall that as r goes to cxd, our 
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Fig. 1. Optimal transmission probability and corresponding maximum 
expected number of successful receptions |]* for the three node example 



Fig. 2. The maximum expected number of successful receptions per second 
under the Af -PSK modulation scheme 



model accommodates the collision channel model. In ( fTOt . 
we have = 1 /3 as r goes to oo, which is consistent with 
the quoted result. Therefore, we conclude that the optimal 
transmission probability derived under collision channel model 
is fundamentally assuming the worst prior on the channel 
conditions by neglecting all the possibilities of succeeding in 
the presence of interference. 

B. On the Effect of the SINR Threshold 

For a given modulation scheme and target BER, the data 
rate is an increasing function of the SINR threshold r ifTSl . 
Hence, if the number of bits to be transmitted is fixed and 
we increase r, the slot duration must be shortened due to 
the increased rate. However, as we have seen in the previous 
section, the transmission probability needs to be lowered. 
To investigate this interrelationship, let us consider the M-PSK 
modulation schem^ in which its symbol rate [symbols/sec/Hz] 
at a given target BER z is given by 



Rs 



[Q- 



.2[Q-i(z)] 

2r sin^(7r/M) 



-)] 



M ^ 2(BPSK) 
M = 4(QPSK) 

M = 2",n > 2 



where Q{z) is the probability that the standard normal random 
variable is greater than z. The maximum symbol rate i?'" is 
given by R™ = 1/kg, where kg is the constant that depends 
on the pulse shape of the analog signal. Without loss of 
generality, we set kg = 1 (i.e., the raised cosine pulse with 
roll-off factor of 1). Denote by W the number of bits to be 
transmitted in each time slot. Then, the transmission duration 
is given by Tsiot = ^ m ■ Consequently, we can redraw 
the maximum expected number of successful receptions as in 
Fig. |2] where we set 77 = 4, z = 10~^, W = 1 bit, and 

^We could use the Shannon capacity formula but it would not make practical 
sense since it gives an asymptotic limit of the rate with arbitrarily small 
probability of error and arbitrarily long block length. 



B — 1 Hz, respectively. Note that in the figure, the units were 
changed from [nodes/slot] to [nodes/sec]. 

C. Performance over Multiple Slots 

Because as time goes on, the set of discovered neighbors 
in each time slot will overlap with the previously discovered 
ones, it is important to know the performance of the discovery 
algorithm over multiple slots. We start with a simple scenario 
in which the set of successful transmitters If is independent 
from slot to slot. This is an artificial scenario because the set If 
is correlated in the sense that a node closer to the receiver has 
a better chance of being discovered in any slots. Therefore, the 
scenario where If is correlated over slots will also be studied 
later in this section. 

Assuming that the set of successful transmitters If is 
independent from slot to slot implies that each neighboring 
node has equal probability of success. Therefore, for a given 
number of successful receptions ht and the total number of 
neighbors J, the probability that a certain neighboring node 
belongs to the set If is given by ht/J and, over D multiple 
slots, the probability is obtained by 



Pr fee 



1 



= hr 



n 



1 



J 



(11) 



We call ( fTTT i a slot-basis prediction to distinguish from the 
Bernoulli approximation which will be given in the sequel. 

Because the event that a particular neighboring node belongs 
to the set If can be approximated by the Bernoulli trial with 
success probability i?[|If |]/J, the number of discovering the 
particular node over D multiple slots is a binomial random 
variable with success probability i?[|If|]/J and the total 
number of trials D. For large D and small i?[|If|]/J, it can 
be further approximated by the Poisson random variable with 
parameter DE[\II\]/J HI p. 435]. Hence, the probabihty 



5 




Fig. 3. The fraction of neiglibors discovered by node A for tlie example 
transmit/listen pattern in Table |l] 

TABLE I 

An example transmit/listen pattern of three nodes (the 
letters t and l stand for transmit and listen, respectively.) 



Time slot 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


Node A 


T 


T 


L 


L 


T 


L 


T 


L 


L 


T 


T 


L 


T 


T 


L 


Node B 


L 


L 


T 


L 


T 


T 


L 


T 


L 


L 


L 


L 


T 


T 


T 


Node C 


L 


T 


T 


L 


T 


L 


L 


T 


T 


T 


T 


T 


T 


T 


L 



that the particular node has been discovered over D multiple 
slots is approximately 1 — exp{—DE[\If\]/ J), which is equal 
to the probability that the Poisson random variable is non- 
zero. Note that the slot-basis prediction in ( fTTT ) and the above 
approximation do not depend on the particular node index 
because we assumed that If is independent from slot to 
slot. Therefore, it can be viewed as the predicted fraction 
of neighbors discovered up to time slot D. In Fig. [3] 
we plot the actual/predicted fraction of neighbors discovered 
for the example transmit/listen pattern in Table J] For the 
figure, the SINR threshold r is set to 1 (in linear scale), 
and the transmission probability is set to 0.4226 which 
is the optimal value by (fTOl i with path loss exponent 77 = 4. 
Note that this is the optimal transmission probability which 
maximizes the expected number of successful receptions at 
that threshold value. Consequently, we obtain the expected 
number of successful receptions per slot as 0.3849 by and 
this value was used to plot the Bernoulli approximation. 

In practice, the probability that a particular node belongs to 
the set of successful transmitters If depends on the distance 
between the node and the receiver and, thus. If is correlated 
over time slots. Therefore, we first obtain the conditional 
probability that a particular node at the specific distance from 
the receiver belongs to the set If given the total number of 
successful receptions ht as (see Appendix IbTi 

Pi-{kelt I \It\ = h„rk = r'} 

Et..(t>T"(l-PT)"-"C» 

ELh, OptH^ - PtV-- (f (Cn + 7n - + 

(12) 



Fig. 4. The probability that a node at distance r' from the reference node 
is in the set of successful transmitters for the three node example given the 
number of successful transmissions ht = 1 

where (n. In, and ^„ are defined as 

C« = (^^ 1 .fl,nft7\^ - /2,n)"-'' (13) 
In = ("^ - f2,.r-''-\l - fl.n) (14) 

C«= (^^J/^(l (15) 

and /i „ is the probability that a transmitter located at distance 
r' from the receiver will succeed among n(> 1) simultaneous 
transmissions and /2,„ is the probability that a transmitter at 
an arbitrary distance will succeed among n(> 1) simultaneous 
transmissions given that one of the other transmitter is located 
at distance r' from the receiver. The expressions for /i_„ 
and /2,„ are given in Appendix |B] The function /„ is the 
shorthand notation for Using the conditional probability 
in (fT2l) . the probability that a particular node at distance r' has 
been discovered over multiple slots can be obtained similarly 
with ( fTTI ). To get a better understanding on the conditional 
probability ( fT2] i. let us consider the following example which 
is in line with the Example 14.11 

Example 4.2: Since each node has two neighbors, it 
is obvious that Pr { /c e If | |If | 0, = r' } = and 
Pr { fc e If I |If I = 2, rfe = r' } = 1. For ht = 1, after some 
calculation, we obtain 

Pr{fcelf I |If| = l,r, = /} 

1 - PT + PT ( 1 - ^"'^ ) 

= 7 ,2 \\ (16) 

2(1-pt)+ ((1-r'^) +r'^)pT 

where the discovery range Rq in ([T]i is normalized to 1 and 
the threshold r is set to 1. We plot the result in Fig. |4] where 
all other parameters are set identical with those used for Fig 
[3] Note that even if r' ~ 0, the probability is not equal to 1 
because of the randomness on node's transmission. 
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Fig. 5. Bank of matched filters 



V. Physical Layer Signal Processing: Detection 
OF THE Transmitting Neighbors 

A. Classical Approach Using a Bank of Matched Filters 

To decide the existence of a signal from a particular node, 
we require the use of matched filters in which the outputs 
of the filters are compared to a certain threshold. Note that 
such a decision is subject to probabilistic errors such as the 
false alarm and miss and, thus, the threshold needs to be 
chosen in some optimum way. Since the set of transmitting 
neighbors and their signal amplitudes are all unknown and 
hard to be tracked, we simplify it by assuming that the sum 
of interfering signals and the noise act as another noise process 



}^ whose samples are i.i.d. with A/'(0, N') 



The average noise power N' can be computed as 



N' = N + 



j-i 

E 

ri = l 



J- 1 



PT"(l-PT)'^-'-"r^P, 



(17) 



where J and Prx is the average number of neighbors and 
the received signal power, respectively. Note that this is a 
reasonable approximation if there is a large number of nodes, 
and it is commonly applied to the analysis of cellular code 
division multiple access (CDMA) systems 

Based on the above approximation, we formulate the binary 
hypothesis testing problem for the transmission of a particular 
node k as 

versus (18) 
Hi-.yt^ 9t s^ + n[ 



which is a composite hypothesis testing because the signal 
is known but still not its amplitude. For the problem, 
a generalized likelihood ratio test method would provide a 
simple decision rule a^ [t 20l P- 51] 



1 



L 

TV' E 

1=1 



, ^1 

Vt.isli ^ P 
Ho 



(19) 



This structure is depicted in Fig. |5] where = /37V'. Note 
that each node transmits with probability pT and listens with 

^Note that the decision rule in U9t is optimal for near which holds 
for most of the practical situations. 



probability 1 —pT, and they are the prior probabilities for each 
hypothesis. 

Given the cost structure Cij which is the cost incurred 
by choosing hypothesis Hi when hypothesis Hj is true, the 
optimum threshold for the minimum average cost is given 
by (3 = , where tt.^ is the prior probability 

of hypothesis l3i. Under the minimum-probability-of-error 
criterion where the cost assignment is done by = for 
i — j, and Cjj = 1 for i ^ j, the threshold is obtained by 
13 = tto/tti = (1 ~pt)/pt- 



B. Random Set Theory-based Approach 

1 ) Motivation and Background: The classical approach us- 
ing a bank of matched filters holds a certain desired property; 
that is, its complexity does not scale with the number of 
nodes by averaging out the effect of random interferences. 
However, the performance would be worse than the class of 
decorrelator detectors. On the other hand, total number of 
transmitting neighbors and their entities are all random in our 
problem setting and, thus, standard decorrelator detectors are 
not directly applicable because they fundamentally assume a 
fixed number of transmitters with known entities. This problem 
naturally falls in the purview of RST which generalizes stan- 
dard probability theory by assigning sets, rather than values, 
to random outcomes (see Appendix lAl and references therein). 
RST has been applied before in the context of multi-source 
data fusion and multi-target identification problems ifTOI - lfTsl 
and, recently, multi-user detection problem in a dynamic 
environment lfT6ll . The utility of RST mostly comes from the 
fact that we can readily treat the random behavior of a random 
number of entities as a single random set having likelihood. 

Mathematically, a random set X is defined as a mapping 
from a sample space 17 to a power set P{S) of a hybrid space 
§. It is referred to as a random finite set, if for all ut E il, the 
set is finite (i.e., |X(w)| < oo). The hybrid space S = R'^xU 
is the Cartesian product of a d-dimensional Euclidean space 
M'' and a finite discrete space U. To illustrate the use of the 
RST-based method, we focus on the case where the hybrid 
space § is given by a finite discrete space as § = {1, ...,K}, 
i.e., the random set Xt is simply equal to the unknown set of 
transmitting neighbors at that time slot, which was previously 
denoted by It in (|2]l. Additionally, if the amplitudes of the 
signals from transmitting neighbors are of interest, we can 
defined it as § = {!,..., K} x R+, where M-|_ is the non- 
negative real space. In the following, we estimate the random 
set Xt based on the observed signal yt in each time slot. 

2) Defining the Estimator: Since the number of transmit- 
ting neighbors in each time slot and the total number of 
neighbors are connected, both the random set Xt and the total 
number of neighbors J need to be jointly estimated as 

arg max /x,,j|Yt (Xj, J'|yt) 

where /xt.j|Yt(') is the likelihood of random set Xj and J 
neighbors given the received signal Yj = yt. By Bayes rule, 
fxtJiYt (XJ, J'lyt) is proportional to 

M|x„j(yt|x;,j')/x.|j(x;|J')/j(J') 



7 



where /Yt|Xt.,/(") is the likelihood of received signal Yt given 
Xt = X( and J = J', /x^jlO is the likelihood of random 
set Xt given J = J' , and /,/(•) is the likelihood of having 
J neighbors. Consequently, the joint MAP estimator of the 
random set Xf and the total number of neighbors J is obtained 
by 

arg max /Y,|x„j(yt|X;, J') /x.|,7(X;| J')/j(J') (20) 

(X(,,/') 

In order to run the above estimator, it is required to specify 
the densities /Yt|Xf,j(-)' /xt|j(-)' and /j(-). We outline how 
these densities are obtained. 

3 ) Specifying the Densities: The received signal y t not only 
depends on the set of transmitting neighbors Xf, but also 
depends on their signal ampUtudes gt = {g^,\/k e Xf}. Since 
the amplitudes are all random, we take average by assuming 
that the amplitudes of the nodes are independent of each other 



Xt,,/ 

OO 



D 

/Y.|x„g„j(yt|xUU')rf^s(5i) 



(21) 

where /Yt|Xt.gt,j(yt|XJ, gj, J') is the likelihood of received 
signal Yf given Xf = Xj, gf = gj, and J = J'. The set 
gf = {g[, . . . ,g'^^,^} and Fg{-) denote the realization of gf 
and the common cumulative distribution of the received signal 
amplitude, respectively. Note that by further conditioning the 
received signal yt on the set of transmitting neighbors and 
their signal amplitudes, the only randomness remaining is in 
the noise nf. Therefore, it is given by 



/Yt|Xt,gt,j(yt|x'f,gf, J') 
/ 



1 



(27riV) 



L/2 



exp 



v 



1=0 \ fcex; / J 



where the bijective function m{k) has been introduced to map 
the elements in Xj to the elements in the set {1, . . . , |Xf |}. 
For example, if Xj — {2, 5}, then m{2) = 1 and m(5) = 2. 

In order to obtain /xt|,/(Xj| J'), we first obtain the belief 
mass of a random set Xf for a given number of neighbors 
J — J' as (see Appendix |A]i 

/3x.|j(C| J') = E E Pr{Xf = B|J = J'} 

n=OB:BCC,|B|=n 

where C is a closed subset of the space §, and B is a real- 
ization of the random set Xf. Let us first derive /3xt|,7(C| J') 
for a particular example as follows. 

Example 5.1: Set § = {1,2,3}, C = §, and J' ^ 2. 1) 
The set {B : B C C, |B| = 0} is given by {0} and, thus, 
Ebbcc |B|=oPr{Xf = B|J = 2} = {l-prf. 2) The set 
{B ; B C C, |B| = 1} is given by {{1}, {2}, {3}} and, thus, 
Eb bcc |b|=i Pr{Xt = B| J = 2} = 3pT (1 - Pt). 3) The 
set {B : B C C, 1B| = 2} is given by {{1, 2}, {1, 3}, {2, 3}} 
and, thus, EB:Bcc,|B|=2Pr{Xf = B|J = 2} = 3pT^ 4) 
Since J' — 2, the probability that a random set Xf is equal to 



B for |B| > 2 is zero. Summing over all possible B's yields 

/3x.i,/(c|j') = eLo OPT-{i~PTf-r 

For general cases, we have 



5Xt|J 



(C| J') = 



J' 

E 

n=0 



Pi {I- Pi) 

n ' 



The belief density /xj|j(X(| J') is obtained by taking the set 
derivative of the belief mass obtained above. For the case 
where the hybrid space § is comprised only of the discrete 
space, it can be readily obtained through the following Mobius 
inversion formula as (see Appendix |A]l 

/x,|j(X;|J') = E (-i)i^'\''i/3x,|j(c|j') 

ccx; 

Example 5.2: Take XJ = {1,3} and J' = 2, then the 
set {C : C C X'f} is given by {0, {1}, {3}, {1, 3}}. 1) 
For C = 0, /3x,|./(C|J') - {l-pif- 2) For C - {1}, 
/3x,|j(C|J') = (l-prf +PTil-PT). 3) For C = {3}, 
/3x,|j(Cl J') = (1 - Pif+PT (I-Pt). 4) For C = {1, 3}, 
/3xt|j(C|J') = (1 -pr f + 2pT (1 -Pt) + PT^- Summing 
over all the possible C's by considering the sign of the terms 
yields/x,|,7(X;|J')=PT^ 
Similarly, for general cases, we have 

/x„.,(x;ir) 



(I-PT) 

0, 



for |X;| < J' 
otherwise 



Note that the scenario in which Xf contains only the identity 
of the transmitters is the simplest case that can be solved 
by standard probability theory as well. The usefulness of 
RST comes when we extend the set Xf so that additional 
parameters, such as signal amplitudes, can be estimated at the 
same time. 

The density /j(J') is the probability that there are J' 
number of neighbors. By defining the discovery region as the 
circle of radius Rq, from Section [III the number of nodes 
inside the discovery region follows a Poisson random variable 
with parameter A7ri?o^. Notice that there is no definite way 
of choosing Rq because the decisions on whether a particular 
node is my neighbor are inconclusive due to the continuity of 
signal strength together with the random effect of noise and 
fading. However, as will be shown in the numerical example, 
the estimator gives more weights to an appropriate size of the 
set which is likely to be occurred at that size of discovery 
region. 

VI. Numerical Results 

Since comparing the performance of the discovery algorithm 
under collision channel to that under multipacket reception 
channel is trivial and obvious, we focus on the comparison of 
the physical layer signal processing methods for the detection 
of transmitting neighbors. For the simulation, a total of 8 
wireless sensor nodes are uniformly deployed over the region 
of interest which is modeled as a circle of radius R which is 
set to 1 km as shown in Fig. |6] The reference node denoted 
by index is assumed to be located at the center of the circle. 
For the wireless channel, we consider a simple path loss model 



Fig. 6. 



(a) Discretization of the distance (b) Deployment scenario 1 

Discretization of the distance for fast computation of )2U and example deployment scenarios 



(c) Deployment scenario 2 



which only depends on the transmitter-receiver distance, and 
the path loss exponent rj is set to 4. The transmission power 
G is set to -24 dBrr0, and the noise spectral density Nq is 
set to -173 dBm/Hz as commonly done. The bandwidth B is 
set to 100 Hz (which is artificially low but really irrelevant 
for our purposes here). The transmission probability is set 
to 0.5 (to better observe the effect of the multiple access). 
For the signatures of the nodes, we used length 15 maximal- 
length sequences [22 1. Gold and Kasami sequences which have 
better correlation property can be used, but comparing the 
performance of different codes is beyond the scope of this 
paper 

Note that the computation of fYt\Xt.j{') in (ED is tedious 
because of the multiple integrals. Hence, we transform the 
integrals to a finite summation by discretizing the distance. To 
do this, we first divide the discovery region of the reference 
node into a finite number of strips having the same area and, 
after that, each strip is further divided into two having the 
same area as shown in Fig. |6(a)| By doing so, the probability 
that a node is on either one of those radii becomes uniform. 
For the numerical examples, we used 7 discrete points and, 
obviously, the accuracy will improve as the number of points 
increases. Also note that, since a total of 8 nodes are uniformly 
deployed over the specified region of interest, the density 
/j( ) on the number of nodes inside the discovery region of 
radius Rq can be more accurately described by the binomial 
distribution B{8, ^j^), rather than the Poisson approximation. 
For the classical approach, it is computed by (3' = /3N' where 
/3 = (1 _ pt)/pt = 1 and N' = NqB + 3.5P„ by ^ 
with Prx — 3.3333 x lO^^G using the cumulative distribution 
function of the received signal power in (O without fading. 

Table HI] and HU] show the detection results for deployment 
scenario 1 and 2 in Fig. |6(b)| -(c), respectively. The letters H, C, 
F, and M stand for hit, correct rejection, false alarm, and miss, 
respectively. Note that F and M are the erroneous detections. 
From Table [III it can be seen that a total of 1 miss and 3 
false alarms are induced under the RST-based approach with 
Rq = 1 km (which is the entire region of interest), and 2 

"'For example, the transmission power of the MICAz Mote, a commercial 
wireless sensor node by Crossbow Technology, Inc., is programmable in 8 
steps from -24 to dBm gl]. 



TABLE II 

Detection results for deployment scenario 1 in Fio. |6(b)| (the 
LETTERS H, C, F, AND M STAND FOR hit, correct rejection, false alarm, 
AND miss, RESPECTIVELY.) 



(a) RST-based approach (Rq = 1 krn) 



Time slot 


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 


Node 1 


CHCCCCHCHCHCHCCHHCCH 


Node 2 


HHHHHCCCHCCHCCCCHHCH 


Node 3 


HHCCCCCHHIEHHHHC C C C C C 


Node 4 


CCCIEIHCHHC C C C H C C H|M|C C H 


Node 5 


HCHHHCCCCCCHCCHCCHCH 


Node 6 


HCCHHCCCHCCCHHCHHHHC 


Node 7 


HHCCHCHCCCHCHIEICCHHHC 


Node 8 


CHHHHHHHCCHHHCHCHHHH 


(b) RST-based approach (i?o =0.5 km) 


Time slot 


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 


Node 1 


CHCCCCHCHCHCHCCHHCCH 


Node 2 


HHHHHCCCHCCHCCCCHHCH 


Node 3 


HHCCCCCHHIEIHHHHCC C CC C 


Node 4 


CCCC|M|CHHCCCCHCCH|M|CCH 


Node 5 


HCHHHCCCCCCHCCHCCHCH 


Node 6 


HCCHHCCCHCCCHHCHHHHC 


Node 7 


HHCCHCHCCCHCHCCCHHHC 


Node 8 


CHHHHHHHCCHHHCHCHHHH 


(c) Classical approach using a bank of matched filters 


Time slot 


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 


Node 1 


CHCCCEHCHEHCHCCHHCCH 


Node 2 


HHHHHCCCHIBCHCCCCHHCH 


Node 3 


HHCCCCCHHEHHHHCC C CC C 


Node 4 


CCCCMCHHCIBC ClMlC C HlMlC C|H1 


Node 5 


HCHHHCCCCCCHCCHCCHCH 


Node 6 


IMICCHIMICCIEHIEC C H HC H H HH C 


Node 7 


HHCCHCHCCIFIHC H|F|C C H HH C 


Node 8 


CHHHHHHHCCHHHIEHCHHHH 



misses and 1 false alarm are occurred with i?o = 0-5 km. 
Under the classical approach, however, a total of 6 misses and 
IQ false alarms occur, which is 4 times more than the RST- 
based approach with Rq ~ 1 km and 5.3 times more than that 
with Rq = 0.5 km. 

From Table HUl we can see that a total of 2 misses and 4 
false alarms are induced under the RST-based approach with 
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TABLE III 

Detection results for deployment scenario 2 in Fig. |6(c)| 



(a) RST-based approach (Rq = 1 km) 


Time slot 


12 3 4 5 6 7 8 9 10 1 1 12 13 14 15 16 17 18 19 20 


Node 1 


CCCCCCHCHHCHHCHHHHCH 


Node 2 


HCHHCIBCHCCHCCIEHCCHCC 


Node 3 


HCHHHHHIMICCCCHHCCHCCC 


Node 4 


HCHHHHCCHHCIMICCHHCCCH 


Node 5 


HCCHHHIEIHHCHHCHCCHCHH 


Node 6 


HCCHCHHHHHHHHCCIFICHCC 


Node 7 


CHCHHCCCHHHCCCHHHCCC 


Node 8 


HCHCCHCHCCHHHCHHHHHH 


(b) RST-based approach (Ro = 0.5 km) 


Time slot 


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 


Node 1 


CCCC C CHCHHCHHCHHHHCH 


Node 2 


HCHHCCCHCCHCCCHCCHCC 


Node 3 


HCH|MIIMIHH|M|CC C C HHC C|MIC C C 


Node 4 


hchPIMIHIc chhcHIcchPccch 


Node 5 


HCCHHH|F|HHCHHCHCCHCHH 


Node 6 


HCCHCHHHHHHHHCCCCHCC 


Node 7 


CHCHHCCCHHHCCCHHHCCC 


Node 8 


HCHC C HCHCCHHHCHHHHHH 


(c) Classical approach using a bank of matched filters 


Time slot 


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 


Node 1 


CCCCCCHCHHC HHCHHHHCH 


Node 2 


HCHHCECHCCHCCIBHCCHCC 


Node 3 


HCHHHHHIHICCC CHHCCHCCC 


Node 4 


HCHHHHCCHHCIHICCHHCCCH 


Node 5 


HCCHHHEHHCHHCHCCHCHH 


Node 6 


HECHCHHHHHHHHCCIECHCC 


Node 7 


CHCHHCCCHHHCCEHHHCCC 


Node 8 


HEHCCHCHCCHHHCHHHHHH 



i?o = 1 km, and 9 misses and 1 false alarm are occurred with 
i?o = 0.5 km. Overall, by reducing the discovery range Rq, 
the occurrence of false alarm is reduced, whereas that of miss 
is increased. Note, however, that most of the misses are due 
to the nodes outside the discovery region (i.e., node 3 and 4). 
In fact, the detection of the nodes inside the discovery region 
is more accurate than before. These are because the density 
/,/(•) gives more weight to the smaller size of the set Xi 
during the decision process. Also, it should be mentioned that 
the nodes outside the discovery region can be detected in a 
particular reception since there is no absolute and deterministic 
boundary for a node detection. On the other hand, under the 
classical approach, a total of 2 misses and 1 false alarms occur, 
which is 1.5 times more than the RST-based approach with 
Rq = 1 km and is tantamount to that with Rq = 0.5 km. Note 
again that most of the errors under the RST-based approach, 
specifically the misses, come from detecting the nodes outside 
the discovery region which is actually a preferred error. 

VII. Concluding Remarks 

In this paper, we studied the problem of neighbor discovery 
in a wireless sensor network. By incorporating physical layer 
parameters, we enabled a more accurate and realistic perfor- 
mance assessment of the chosen neighbor discovery algorithm. 
Unlike the collision channel, such incorporation required us 
to explicitly specify the set of transmitting neighbors in each 



time slot based on the received signal. With the aid of the 
theory of random set, we were able to present an alternative 
method to the classical approach using a bank of matched 
filters for detecting the set of transmitting neighbors. The 
performance gain of using this new method comes in an 
additional cost of complexity. Several steps are still needed 
to complete our work. To fully validate the advantages of the 
alternative method, the performance evaluation needs to be 
supplemented with additional simulations. Also, it should be 
noted that we focused on discovering unidirectional links as 
in most of the previous work. However, for routing and other 
important functions of a network, bidirectional links simplify 
the network operation. Therefore, it is of interest to develop a 
self-organizing protocol which establishes bidirectional links 
in a distributed manner. 

Appendix A 
Fundamentals of the Theory of Random Set 

Random set theory (RST) and its associated finite-set statis- 
tics (FISST) are extensively studied in the book Mathematics 
of Data Fusion llOj . This section briefly introduces the 
essentials of RST, and refer to [10] and other companion 
publications ifTTl - lfTSl for more details. In RST, the belief 
mass of a random finite set X plays a similar role to that 
of the cumulative distribution function of a random variable, 
and is defined as 

/3x(C) ^Pr{XC C} (22) 

where C is a closed subset of the space §. For example 
lfT3l . if X = {x}, i.e., a singleton, where x is a random 
vector, ,3x(C) = Pr{X C C} = Pr{x e C} where 
Pr{x e C} is the probability measure on S. From this, it can 
be conjectured that the belief mass generalizes the ordinary 
probability measure. It is straightforward to write the belief 
mass in (l22l i as 

/3x(C) = MX = B} = ^ /x(B) (23) 

BCC BCC 

where /x(') is the belief density of a random set X, and it 
plays the role of a probability density function. One natural 
question is how to derive /x(') from /3x(-) which will be 
answered in the sequel. 

Consider the case where the hybrid space § is comprised 
only of a finite discrete space U . Then, the belief density 
/x(-) of ^ random finite set X can be obtained via the Mobius 
inverse transform of /?x(-) as 

/x(B) - (-l)"'^'"/^x(C) (24) 

CCB 

by viewing sets as points in another space. 

Example A.l: Take S = [/ = {a,b}. Then, V{^) = 
{0, {a}, {6}, {a, 6}}. Assign probability to each element of 
the power set so that the sum is equal to 1: 

./x(0) = 0.1, /x({a}) = 0.4, /x({6}) = 0.3, /x({a, h}) - 0.2 

Then, we can obtain the values of the belief mass using ( |23]) . 
For example, /3x({6}) = /x(0) + /x({6}) = 0.4. Likewise, 
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we have 

/3x(0) = 0.1,/3x({a}) = 0.5,/3x({a,fe}) = 1 

We can also retrieve the values of the belief density from the 
belief mass using ( l24b . For example, fx{{b}) = — /3x(0) + 

/?x(m) = o.3. 

For a general case where § = K'' x [/ with d > 0, the Mobius 
inverse transform is not applicable because it appUes only for 
a finite partially ordered set |10|. The continuous analog of 
the Mobius inverse transform, which is often called the set 
derivative, at Z = {zi, . . . , z„} with zi ^ ■ ■ ■ z„, is defined 
by Ml 



and /2,i = 0. Using these probabilities, we can compute the 
numerator as 

Pi-{kelt,\It\ = ht,rk^r'} 
J 



E 

n=ht 



n-ht 



P^(l-PT)''-"/i,„/.(r') (27) 



SF{C) 



dZ 



5Zn ■■■Szi 



(C)^ 



IF 



dZn SZn- 



1 • • ■ 



Szi 



(C) 



where 



Similarly, the denominator can be expanded as 

PT{\rt\^ht,rk^r'} 

PT{\n\^ht\rk^r',\It\=n} 

■PT{rk^r',\lt\^n} (28) 



E 

n=ht 



6F{C) ^ F(CUF,J-F(C) 
= lim — 



5zi i/(_E,j^o v{Ez,] 

where E^^ is a small neighborhood of Zi and i^(-) is the hyper- 
volum( 
F(C). 



volume (i.e., Lebesgue measure) of a given set, and ^^g^^ — 



Appendix B 
Derivation of (fT2]) 



Denote by J'n the set of elements in the power 
set V{J) whose cardinality is equal to n, and 
Jni'^ <fri) the set of elements containing a specific 
node index fc. For example, if J' = {1,2,3}, then 
V{J) = {0,{1},{2},{3},{1,2},{1,3},{2,3},{1,2,3}}, 
J2 = {{1,2}, {1,3}, {2, 3}}, and = {{1, 2}, {1, 3}}. 
Using these notations, the first probability in the summation 
of ( |28] l can be split into 



The details in deriving ( fT2] i is delivered in this section. By 
applying Bayes rule to the conditional probability, we have 



\\\\=hu\t^S\ri,=r' ,\\t\=n\ 



_ Pr{fceIMIf|^fet,rfc=r^} 
Pr{|If| =/it,rfe =r'} 



(25) 



Px{\\\\=ht\rk = T',\\\=n} 

= J2 Mk e I?, 

+ Pi-{k i I?, \\\\ = hult =S\tu= r', - n} 



E 



Pr{|I| 



ht, I* 



n} (29) 



whose numerator and denominator are specified one by one 
in the following. Conditioning the numerator on the number Since nodes are randomly transmitting with equal probability. 



of transmitters and applying Bayes rule once again yields 

Pr{fcelMl?|=/^„rfe = r'} 

Pr{lI^l = /.,|A;eI^rfc = rMl,|=n} 

■PT{kellrk^r',\lt\^n} (26) 



J 



n=ht 



each realization of the set of transmitters having same cardinal- 
ity are equiprobable, i.e., Pr { It = 5 | jlfj = n } = l/(^), for 
all S € Jn, and the set of transmitters It itself is independent 
over time slots (but the set of successful transmitters If is 
not). Therefore, the probability in the first summation of i 
is computed as 

Pr{fc G II = hult ^S\tu= r', |It| = n} 



Define /i „ as the probability that a transmitter located at 
distance r' from the receiver will succeed among n{> 1) 
simultaneous transmissions and it is given by 



n-l 
ht~l 



/^;^'(l-/2,.)"-''7l,np7 (30) 
\nJ 

for all S G JT,^. Likewise, we can specify the remaining prob- 
abiUties in By noting that \J,,\ = (;^) and | = 
and after some manipulation, ( |29] l becomes 

Pi-{\lt\ ^ htln = r\\It \ = n} 



fl.r. 



1 



\ 1=2 



r']dFp{x2)---dFp{xr,) 



Also, define f2.,i as the probability that a transmitter at an 
arbitrary distance will succeed among n{> 1) simultaneous 
transmissions given that one of the other transmitters is known 
to be located at distance r' from the receiver which is obtained 
by 



/2.n = 1- 



J ■ ■ ■ J rE dFp{x2\r')dFp{x3) ■ ■ ■ dFp{xn) 





n 1 n 




l\ht 


n 


/n - 1 


J 


I ht 



/2'';ji-/2,«r-'''-'(i-/i,n) 
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where /„ is the shorthand notation for O. The second 
probabiHty in ( |28] ) is simply 

PrK = r', |I,| - n} - Qpt"(1 - PT)^-"/,.(r') (32) 

By substituting OTI ) and ( l32b into ( |28] ). the denominator 

becomes 

Pr{|I?| = /it, rfe-r'} 

= E (7 + ^« - + (;^)pt"(1 -PT)''-"/.(r') 

(33) 

where the shorthand notations C„, 7„ and ^„ are defined in 
Section HV^ 
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